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Abstract 

In 1967, J. Edmonds introduced the problem of computing the rank over the rational function field 
of an n x n matrix T with integral homogeneous linear polynomials. In this paper, we consider the 
non-commutative version of Edmonds’ problem: compute the rank of T over the free skew field. This 
problem has been proposed, sometimes in disguise, from several different perspectives, in the study of e.g. 
the free skew field itself (Cohn 1973), matrix spaces of low rank (Fortin-Reutenauer, 2004), Edmonds’ 
original problem (Gurvits, 2004), and more recently, non-commutative arithmetic circuits with divisions 
(Hrubes and Wigderson, 2014). 

It is known that this problem relates to the following invariant ring, which we call the ¥-algebra of 
matrix semi-invariants, denoted as R(n, m). For a field F, it is the ring of invariant polynomials for the 
action of SL(n, F) x SL(n, F) on tuples of matrices - (A, C) G SL(n, F) x SL(n, F) sends (Bi ,..., R m ) € 
A7(n,F)® m to (AB\C T ,..., ABmC T ). Then those T with non-commutative rank < n correspond to 
those points in the nullcone of R(n,m). In particular, if the nullcone of R(n,m ) is defined by elements 
of degree < a, then there follows a poly(n, a)-time randomized algorithm to decide whether the non- 
commutative rank of T is full. To our knowledge, previously the best bound for a was 0(n ■ 4™ ) over 
algebraically closed fields of characteristic 0 (Derksen, 2001). 

We now state the main contributions of this paper: 

• We observe that by using an algorithm of Gurvits, and assuming the above bound a for R(n, m) over 
Q, deciding whether or not T has non-commutative rank < n over Q can be done deterministically 
in time polynomial in the input size and a. 

• When F is large enough, we devise an algorithm for the non-commutative Edmonds problem in 
time polynomial in (n + 1)!. Furthermore, due to the structure of this algorithm, we also have the 
following results. 

— If the commutative rank and the non-commutative rank of T differ by a constant, then there 
exists a randomized efficient algorithm to compute the non-commutative rank of T. This 
improves a result of Fortin and Reutenauer, who gave a randomized efficient algorithm to 
decide whether the commutative and non-commutative ranks are equal. 

— We show that a < (n + 1)!. This not only improves the bound obtained from Derksen’s work 
over algebraically closed field of characteristic 0 but, more importantly, also provides for the 
first time an explicit bound on a for matrix semi-invariants over fields of positive characteristics. 
Furthermore, this does not require F to be algebraically closed. 
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1 Introduction 


1.1 The non-commutative Edmonds problem 

In 1967, J. Edmonds introduced the following problem f30]: let X = {x\,... ,x m } be a set of variables. 
Given an n x n matrix T whose entries are homogeneous linear polynomials from Z[X\, determine the rank 
of T over the rational function field Q(A), denoted as rk(T). The decision version of Edmonds’ problem is to 
decide whether T is of full rank or not; this decision version is better known now as the symbolic determinant 
identity testing (SDIT) problem. It is natural to consider this problem over any field F. If |F| is constant, 
this problem is NP-hard [9]. This is not the setting we are concerned with we will always assume |F| to be 
at least f2(n). 

When |F| > 2 n, the Schwartz-Zippel lemma provides a randomized efficient algorithm. To devise a 
deterministic efficient algorithm has a long history, and is of fundamental importance in complexity theory. 
Originally, the main motivation was its applications to certain combinatorial problems, most notably the 
maximum matching problem on graphs, as exploited by Tutte m, Edmonds [30], Lovasz [52!, among others Q 
Since 2003, a major incentive to study SDIT arises from its implications to circuit lower bounds, as shown 
in the wonderful work by Kabanets and Impagliazzo [48j . Improving the results in [48], Carmosino et al. 
showed that such an algorithm implies the existence of an explicit multilinear polynomial family such that 
its graph is computable in NE, but the polynomial family cannot be computed by polynomial-size arithmetic 

circuits m- 

In this paper, we study Edmonds’ problem in the non-commutative setting. In other words, we view the 
entries of T as elements of F(A), the algebra of non-commutative polynomials over F. To state this, we 
need a non-commutative counterpart of the rational function field. Note that, due to non-commutativity, 
the best we can hope for is a skew field (a.k.a. a non-commutative field or a division ring). The free skew 
field is the non-commutative analogue of the rational function field. For T a matrix with homogeneous linear 
polynomials, Fortin and Reutenauer |33| defined the non-commutative rank of T, denoted as ncrk(T), as its 
rank over the free skew field. By the non-commutative Edmonds problem we mean the problem of computing 
ncrk(M), and by the non-commutative full rank problem (NCFullRank) we mean the problem of deciding 
whether ncrk(M) is full or not. 

Since we will not define the free skew field formally, we provide a definition of ncrk(T) that is due to 
Cohn [ 116] : ncrk(T) is the minimum s £ Z + s.t. T can be written as PQ 1 where P and Q are matrices 
with homogeneous linear polynomials from F(X), and of size n x s and s x n, respectively. It may also be 
worthwhile to recall that, since every module over a skew field is free, the row (resp. column) rank of a 
matrix over a skew field can be defined as the rank of the module generated by the rows (resp. columns). 
Then just as the case of matrices over fields, it can be shown that row rank and column rank are equal, so 
either of them defines the rank of a matrix over a skew field. 

The free skew field was first constructed by Amitsur [3] , and alternative constructions were subsequently 
given by Bergman [5] , Cohn [15] , and Malcolmson [54] . We refer the reader to [43] by Hrubes and Wigderson 
for a nice introduction to the free skew field from the perspective of algebraic computations. Cohn’s books 
pT5[;l6] serve as a comprehensive introduction to this topic. 

It will be clear soon that rk(T) < ncrk(T), and Fortin and Reutenauer showed that ncrk(T) < 2rk(T), 
and exhibited an example T for which ncrk(T) = 3/2-rk(T) [33j . In |17j . Cohn and Reutenauer presented an 
algorithm to decide whether ncrk(T) is full or notQ which puts this problem in PSPACE since it reduces to 
testing the solvability of a system of multivariate polynomial equations. Unlike its commutative counterpart, 
it is not even clear that the non-commutative Edmonds problem has a randomized efficient algorithm. In 
Section OI we will discuss a natural randomized algorithm for NCFullRank, but its efficiency will depend 
on an invariant-theoretic quantity. 

x In these applications, T is usually of certain specific forms, for example, as a mixed matrix: each entry is a single variable 
or a field element, and each variable appears only once. 

2 As remarked in m, this algorithm can be generalized to compute ncrk(T). 
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1.2 Equivalent formulations of the non-commutative Edmonds problem 

Like Edmonds’ problem, its non-commutative counterpart also has a long history, though in the literature, 
it often stated in a very different way. In 1973, Cohn first studied this problem from the perspective 
of understanding the free skew field, and showed it to be decidable in Ham- In 2003, Gurvits posed 
this problem in his remarkable work on Edmonds’ problem fl!)’ . Recently, Hrubes and Wigderson arrived 
at this problem in their study of non-commutative arithmetic circuits with divisions |43j . Indeed, a very 
intriguing feature of the non-commutative Edmonds problem is the existence of several interesting equivalent 
formulations. Instead of relying on the free skew field, these formulations use either linear algebra, or concepts 
from invariant theory, or quantum information theory. They are scattered in the literature, so we collect 
them here, to illustrate the various facets of this problem, introduce some previous works, and motivate the 
study of the non-commutative Edmonds problem. 

To state these formulations we need some notations. M(n, F) denotes the linear space ofnxn matrices 
over F. A linear subspace of M ( n , F) is called a matrix space. Given T, a matrix of linear forms in variables 
X = {xi, ..., x m }, write T = x\Bi + x 2 B 2 + • • • + x m B m where B t G M(n, F). Let B := (Pi,..., B m ), 
where (•} denotes linear span. The rank of B , denoted as rk(B), is defined as max{rk(P) | B £ B}. We call 
B singular , if rk(B) < n. When |F| > n, as we will assume throughout, rk(T) = rk(B)0 We shall soon see 
that ncrk(T) corresponds to some property of B as well, so that we can translate the study of commutative 
and non-commutative ranks of T entirely to the study of B. 

Some of these formulations make sense only subject to certain conditions. In such cases we indicate the 
conditions needed before that formulation. 

1. Given B = (Bi ,..., B m ) < M(n, F), a subspace U < F" is called a c-shrunk subspace of B , for c € N, 
if there exists W < F", such that dim (W) < dim(t/) — c, and for every B G B, B{U) < W. U is called 
a shrunk subspace of B, if it is a c-shrunk subspace for some cGZ + . 

Question: compute the maximum c such that there exists a c-shrunk subspace. 

Remark: Cohn showed that the non-commutative rank is not full if and only if there is a shrunk 
subspace US- This was generalized by Fortin and Reutenauer |33t Theorem 1] who showed a precise 
relationship between non-commutative rank and the existence of c-shrunk subspaces. Their motivation 
to consider this problem was to connect matrices over linear forms on the one hand, and matrix spaces 
of low rank on the other. The latter topic was studied in e.g. mm- By ,33], we can define the 
non-commutative rank of B as 

n — max{c G {0,1,..., n} | 3c-slrrunk subspace of B }, 
and it follows that ncrk(P) = ncrk(T). So we may (and do) identify T with B in the following. 

2. (F is large enough) Given B = (Pi,..., B m ) < M(n, F), the dth tensor blow-up of B , is B^ := 
M(d, F) (g) B < M(dn, F). It is clear that rk(p[ d l) > d • rk (B). We shall prove that when F is large 
enough, then d always divides rk(pl d l). Furthermore, when d > n, then rk(p[ d+1 ])/(d+l) > rk (B^)/d. 
See Lemma T5.71 Corollary 15.81 and Remark 14.21 

Question: compute lim^oo ik(B^)/d. 

Remark: That NCFullRank is equivalent to deciding whether rk(B^) = nd for some d was shown 
by Hrubes and Wigderson |43] . Our formulation here is a straightforward quantitative generalization 
of their statement. Hrubes and Wigderson’s motivation was to study non-commutative arithmetic 
formulas with divisions. 

3. (F = C) Given B\,... ,B m G M(n, Q), construct a completely positive operator P : M(n,C) —»• 

M(n,C), sending A —> ^iAB\. For c G N, P is called rank c-decreasing, if there exists a 

positive semidefinite A , such that rk(A) — rk(P(A)) = c. 

3 As when the field size is large enough, the complement of the zero set of a nonzero polynomial is non-empty. 
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Question: compute the maximum c such that P is rank c-decreasing. 

Remark: Gurvits stated the problem of deciding whether P is rank non-decreasing or not [3S]. His 
original motivation was to study Edmonds’ original (commutative) problem, and the main result in [301 
solves the case when the commutative and non-commutative rank coincide (see Theorem 13.II) . 

4. (NCFullRank) Consider the action of (A, C) £ SL(n, F) x SL(n, F) on a tuple of matrices (Hi,..., B m ) £ 
M(n, F)® m by sending it to (HHiC t , ..., AB m C T ) O Let R(n , m) be the F-algebra of invariant poly¬ 
nomials with respect to this action. The nullcone of R(n, to) is the common zero of all homogeneous 
positive-degree polynomials in R(n,m). 

Question: decide whether or not (Hi,..., B m ) is in the nullcone of R(n, to). 

That the original formulation is equivalent to (1) comes from [33]. The equivalence between (1) and (3) is 
straightforward. The equivalence among decision versions of (1) and (2), and (4) can be obtained via the 
ring of matrix semi-invariants, as described in Section ll.3l One way to prove the equivalence between (1) 
and (2) is via Theorem 15.111 

To summarize, the non-commutative Edmonds problem can be derived naturally from the perspectives 
of quantum information theory^ and invariant theory. It is of great interest in non-commutative algebraic 
computation with divisions, and in the study of matrix spaces of low rank. Our motivation to study this 
is because a solution to the non-commutative Edmonds problem will throw light on its commutative coun¬ 
terpart. Shrunk subspaces form a natural and important witness for the singularity of a matrix space. 
Therefore, if the non-commutative Edmonds problem can be solved deterministically in polynomial time, it 
means that, for SDIT, the bottleneck lies in recognizing those singular matrix spaces without such witnesses. 
This connection will be detailed in Section [3] 

1.3 Matrix semi-invariants 

Formulation (1), (2) and (4) have a common origin, namely the invariant ring R(n,m) described in (4). We 
shall call R(n, m) the ring of matrix semi-invariants , as (1) it is closely related to the classical ring of matrix 
invariants [63] (see below for the definition, and pQ!23] for the precise relationship between these two rings); 
and (2) it is the ring of semi-invariants of the representation of the m-Kronecker quiver with dimension 
vector (n, n). Here, the m-Kronecker quiver is the quiver with two vertices s and t , and m arrows pointing 
from s to t. When to = 2, it is the classical Kronecker quiver. The reader is referred to [23127, 66 for a 
description of the semi-invariants for arbitrary quivers. 

The equivalence between (1) and (4) comes from the observation that the (Hi,..., B m ) with a shrunk 
subspace are exactly the points in the nullcone of R(n,m) pQd • The equivalence between (2) and (4) can 
be seen from the first fundamental theorem (FFT) of matrix semi-invariants [3123 GZ] EE]- To describe 
this we need some notations: for n £ N, [n] := {l,...,n}. Note that R{n,m) C F[a;.^] where i,j £ [n], 

k £ [to], and x\ k J are independent variables. Let Xk = (%i k j)i,je[ n ] be a matrix of variables. Then for 
Hi,..., A m £ M\d, F), det(Hi ® Xi + ■ ■ ■ + A m ® X m ) is a matrix semi-invariant, and every matrix semi¬ 
invariant is a linear combination of such polynomials. Therefore, (Hi,..., H m ) is in the nullcone, if and only 
if for all d £ Z + and all (Hi,..., H m ) £ M(d, F)® m , Hi <8> Hi -I— • + H m ® H m is singular. 

It is well-known that the matrix semi-invariant ring is finitely generated, by Hilbert’s celebrated work 1421 . 
This implies that there exists some integer d such that those matrix semi-invariants of degree no more than 
d define R(n,m). This motivates the following definition. 

Definition 1.1. /3(R(n,m)) is the smallest integer d such that R(n,m) is generated by invariants of degree 

< d. 

4 This action can also be written as: ( A , C) sending (Bi,...,B m ) to (AB\C ~ 1 ,..., AB m C ~ 4 5 ). We adopt the transpose 
rather than the inverse, as the transpose yields a polynomial representation rather than a rational representation. Furthermore 
when Derksen’s result is applied to the transpose, it gives a somewhat better bound (Fact. 11.21 . 

5 It remains to investigate the physical meaning for a super-operator to be rank non-decreasing though. 
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An explicit upper bound on f3(R(n,m)) turns out to be particularly interesting for the purpose of the 
NCFullRank problem. As already suggested by Hrubes and Wigderson [43], if R(n,m ) has a degree bound 
/3 = f3(R{n y to)), one can do the following: take m d x d variable matrices Y ly ..., Y m , Yk = {y^J) and form 
the polynomial 

det(Y) <§) Bi + ■ ■ ■ + Y m B m ) G (1-1) 

Letting d go from 1 to (3, this system of polynomials characterizes ncrk(T) < n: ncrk(T) < n if and only if all 
these polynomials are the zero polynomial. This immediately gives a randomized algorithm for NCFullRank 
over large enough fields, with time complexity poly(n,/3). 

In fact, for the above application, what really matters is another important bound a = <r(R(n y m)). This 
is defined as the minimum integer d with the property that (B i,..., B m ) G M(n, F)® m is in the nullcone if 
and only if all polynomials of degree < d in R(n , to) vanish on {B\, ..., Bm}. It is clear that a < /3, and the 
above reasoning goes through when (3 is replaced by a. 

Over algebraically closed fields of characteristic 0, by directly employing Derksen’s bounds for invariant 
rings satisfying certain general conditions US, the following bound can be derived. For completeness we 
include a proof in Appendix lAl 

Fact 1.2 ( [191 ). Over algebraically closed fields of characteristic 0, for R(n,m), /3 < max{2,3/8 • n 4 • cr 2 }, 
and a < 1/4 • n 2 • 4 n2 . 

In particular, if cr is polynomial in n, then (3 is polynomial in n as well. 

It is generally believed that over fields of characteristic 0, the bounds we get for R(n y to) using Derksen’s 
theorem is far from optimal. One reason to believe so is that R(n, m) is closely related to another ring 
of invariants: let A G SL(rc,F) act on (Bi ,..., B m ) G M(n, F)® m by simultaneous conjugation i.e. A 
sends the tuple to (A£>iA _1 ,..., AR m A -1 ). Denoted by S(n,m ), this is just the classical ring of matrix 
invariants [53]. The structure of S(n,m) is well-understood. Over fields of characteristic 0, the first and 
second fundamental theorems for S(n, to), and an n 2 upper bound for /3(S(n, to)) were established in 1970’s, 
by the works of Procesi, Razmysolov, and Formanek [32163164] . See [M for the precise relationship between 
the rings R(n,m) and S(n,m). Note that when applied to S(n,m) over characteristic 0, Derksen’s bound 
yields /3(S(n, to)) < max{2, 3/8 • n 2 ■ a 2 } and cr(S(n, to)) = n°( n \ far from the n 2 bound given above. 

Another reason to believe that Derksen’s bounds are far from optimal is that for certain small m or n , 
explicit generating sets of R(ir,m) have been computed in e.g. [221l23j|^; j i37l. In these cases, elements of 
degree < n 2 generate the ringjj 

If we turn to positive characteristic fields then, to our best knowledge, no explicit bounds for f3(R(n, to)) 
nor cr(R{n,m)) have been derived. Note here that the relation between (3 and cr as in Fact 11.21 is not 
known to hold, due to the assumption on the field properties there. This case is important, for example, in 
the application to identity testing, and division elimination for non-commutative arithmetic formulas with 
divisions over fields of positive characteristics [43j. For S(n,m) over fields of positive characteristics, the 
FFT was established by Donkin in [281129] , Over fields of positive characteristic an 0(n 3 ) upper bound for 
cr(S(n, to)) can be derived from [12] Proposition 9], and in [24lf26] Domokos proved an upper bound 0(n 7 m n ) 
on j3(S(n, to)). 

1.4 Our results 

In the previous sections, we defined the non-commutative Edmonds problem and the NCFullRank problem, 
and illustrated their connections to matrix semi-invariants. Indeed, our results suggest that progress on one 
topic helps to advance the other as well. 

The first result shows that an upper bound for a(R(n,m)) actually implies a deterministic algorithm for 
NCFullRank over Q, rather than just a randomized one as in Section [1731 


6 We thank M. Domokos for pointing out this fact to us. 
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Proposition 1.3. Over Q, if the nullcone of R(n,m) is defined by elements of degree < a = a(n,m), then 
there exists a deterministic algorithm that solves NCFullRank with bit complexity polynomial in a and the 
input size. 

In particular, if a is a polynomial in n and m, then NCFullRank can be solved deterministically in 
polynomial time over Q. The key ingredient here is Gurvits’ algorithm for the Edmonds’ problem, although 
that algorithm works only under a promise [35]. The distinction between deterministic and probabilistic is 
important: as illustrated at the end of Section [13 our original motivation of studying NCFullRank is to 
gain an understanding of SDIT, for which the question is to devise deterministic efficient algorithms. 

Our main result is an algorithm that solves the non-connnutative Edmonds problem using formulation 
(2). To ease the presentation, we give an informal statement of the main theorem in Sectionl5liTheorem l5.nl) 
here, and discuss its two consequences. 

Theorem 1.4 fTheorem l5.HI informal). Given a matrix space B < M(n, F) over a large enough field, there 
exists a deterministic algorithm that computes rk (B) using poly((n + 1)!) many arithmetic operations. Over 
Q the algorithm runs in time polynomial in the bit size of the input and (n + 1)!@ 

In [531 . Fortin and Reutenauer asked for “an algorithm which uses only linear-algebraic techniques.” 
Indeed, the algorithm for Theorem 11.41 may be viewed as one, though it relies on certain routines dealing 
with objects from cyclic field extensions and division algebras. 

Two interesting consequences now follow. Firstly, we have a randomized efficient algorithm to compute 
the non-connnutative rank if it differs from the commutative rank by a constant. (Recall that rk(I?) < 
ncrk(£>) < 2rk(£>).) Its easy proof is put after the statement of Theorem 15.Ill 

Corollary 1.5. For B < M(n, F), let c = ncrk(S) — rk (B), and assume F is of size f l(n ■ (n + 1)!). Then 
the non-commutative rank of B can be computed probabilistically in time polynomial in (n+ l) c+1 . 

Secondly, we immediately obtain an explicit bound for a(R(n, m)) as a consequence of Theorem l5.11l By 
Fact 11.21 we also get a bound on f3(R(n,m)), over an algebraically closed field of characteristic 0. Its proof 
is also put after Theorem 15. Ill 

Corollary 1.6. Over any field F of size fl(n ■ ( n + 1)!), a(R(n,m)) < [n + 1)!. If furthermore F is of 
characteristic 0 and algebraically closed, then /3(R(n, m)) < max{2, 3/8 • n 4 • ((n + l)!) 2 }. 

This improves the bounds in Fact 11.21 over algebraically closed fields of characteristic 0. More impor¬ 
tantly, to the best of our knowledge, this provides an explicit bound for a(R(n,m)) over fields of positive 
characteristic for the first time. Furthermore to get this bound we only assume the field size to be large 
enough, whereas Fact 11.21 requires our held to be algebraically closed. 

While the improvement from 2°(" ^ to 2°( nlosn ' > is modest, we believe it is nonetheless an interesting 
improvement from the technical point of view: note that the dimension of SL(n,F) is n 2 — 1. In the line of 
research for bounds of an invariant ring R with respect to a group G (cf. [191162] V the dimension of G has 
to stand on the exponent for a(R), and to get a bound as 2°f dlm ( G b seems difficult there. Furthermore, the 
idea of using correctness of algorithms to get bounds on quantities of interest in invariant theory seems new, 
and may deserve to be explored further. 

We also obtain certain structural results for R(n,m), which are reported in [47] . 

1.5 More previous works 

Connections between invariant theory and complexity theory. The results in this paper suggest a new link 
between invariant theory and complexity theory. Connections between the two fields have been emerging in 
recent years. We have already alluded to the direct connection with non-commutative arithmetic circuits, in 

‘ All algorithms presented in this paper, when working over Q, have bit complexity polynomial in the input size, and some 
additional parameters. Sometimes, we may omit the input size but focus on those more important parameters. 
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the work of Hrubes and Wigderson [33] above. In a series of papers titled geometric complexity theory (GCT) 
[55J[50] (see also |U[57]), Mulmuley and Sohoni pointed out possible deep connections between problems in 
invariant theory and complexity theory. GCT addresses the fundamental lower bound problems in complexity 
theory, e.g. the permanent versus determinant problem, by linking them to problems in representation 
theory and algebraic geometry. In particular, in ]5'8] . Mulmuley established a tight connection between 
derandomizing the Noether normalization lemma, and black-box derandomizing the polynomial identity 
test. The degree bounds of various invariant rings are of central importance in that work. We briefly remark 
that a polynomial bound for /3 (R(n,m)), if proven, will yield similar results as what the n 2 degree bound 
for S(n,m) has yielded in [551 . 

More previous works on Edmonds ’ problem. Some earlier work on this problem was cited at the beginning 
of this article. Here we mention more related work. 

An interesting instance of Edmonds’ problem is the module isomorphism problem. Specifically, assume 
that we are given two n-dimensional modules U and U' for the free algebra A over F with k generators 
as fc-tuples Gi,... ,Gk and G\...., G' k of n by n matrices. Then Hornet/, U') is the F-linear subspace of 
Hornet/, U'), identified with M(n,F), consisting of matrices X with XGi = G\X (i = 1,..., k). As these 
conditions are linear in the entries of X , the space Hom _4 (U,U') can be obtained by solving a system of 
homogeneous linear equations in n 2 elements. Furthermore, U' is isomorphic to U if and only if there exists a 
nonsingular matrix in Hornet/, U'). In turn, any such nonsingular matrix witnesses an isomorphism and, by 
the Schwartz-Zippel lemma, for sufficiently large base field a random homomorphism will be an isomorphism. 
Due to the special algebraic structure behind this problem, it can be solved even by deterministic polynomial¬ 
time methods, see the method of Chistov, Ivanyos and Karpinski m working over many fields, or a different 
approach of Brooksbank and Luks [6] which works over arbitrary fields, and an extension of the first method 
to arbitrary fields given by Ivanyos, Karpinski and Saxena in |45| . Interestingly, the general case of finding 
a surjective or injective homomorphism between non-isomorphic modules deterministically turns out to be 
as hard as the constructive version of Edmonds’ general problem [45]. 

Recall that one motivation to study Edmonds’ problem is due to its implications to certain combinatorial 
problems. This line of research mostly focuses in the case when the given matrices are of particular form, e.g. 
rank-1 and certain generalizations |36ll4lH45ll61l as used in bipartite graph matchings, or skew symmetric 
rank-2 and certain generalizations jSSJEjOSBi as used in general graph matchings. 

Another line of research deals with matrix spaces that satisfy certain properties. Note that properties 
of matrix spaces should not depend on a particular basis. For example, we can define a property of matrix 
spaces as, “having a basis consisting of rank-1 matrices.” So if B has a basis consisting of rank-1 matrices, B 
may not necessarily be presented using this rank-1 basis. We are not aware of any result on the complexity 
of finding rank-1 generators for rank-1 spanned matrix spaces, if it is given by a basis consisting of not 
necessarily rank-1 matrices. We believe that the problem is hard. Thus the results in [SSSDHSIISII , which 
assume that the input is given by a rank-1 basis, do not translate to algorithms for rank-1 spanned matrix 
spaces. 

As far as we are aware, there are two references for SDIT which assume only properties of matrix 
spaces. The first one is Gurvits’ algorithm in [39]; this algorithm works over Q, and assumes a property 
which Gurvits called “Edmonds-Rado.” His algorithm, put in the context of this paper, is rephrased as 
Theorem o Gurvits left open the problem of developing a deterministic efficient algorithm for rank-1 
spanned matrix spaces over finite fields. This was settled in affirmative in [33] , the other reference that 
assumes properties of matrix spaces. 

Recall that the other major incentive to study Edmonds’ problem is to understand arithmetic circuit 
lower bounds via [T01I48] . We believe that for this goal, a better indication of progress is to use properties 
of matrix spaces, rather than properties of the given matrices. One reason is that, whether a matrix space 
contains a nonsingular matrix, is a property of matrix spaces. Another reason is that many properties of 
matrix spaces seem difficult to test algorithmically. Furthermore, note that in this paper we heavily rely on 
algorithmic techniques developed in [39] and [44] . This may be viewed as another evidence of the importance 
of working with properties of matrix spaces. 
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Connections to Kronecker coefficients. Recently, there was an interest in studying the semi-invariants of the 
m-Kronecker quivers due to its connection with the Kronecker coefficients [Dina, namely the multiplicities 
in the direct sum decompositions of the tensor products of two irreducible representations of symmetric 
groups. Giving a positive combinatorial description of these coefficients is considered to be one of the most 
important problems in the combinatorial representation theory of symmetric groups. 

1.6 Update on recent progress 

There have been some exciting developments since we posted a version of this paper on the arXiv. 

First, Garg et al. presented a deterministic polynomial-time algorithm for computing the non-commutative 
rank over Q ,31]. This is achieved via a closer analysis of Gurvits’ algorithm [35]. Their analysis uses the 
exponential bounds on a(R(n 1 to)) as proved in this paper, or deducible from Derksen’s result JTS]. It should 
be noted that that the given algorithm is not constructive, in that it fails to produce a witness (e.g. shrunk 
subspaces). 

Second, Derksen and Makam proved that er < v? — n over large enough fields m- Over fields of 
characteristic zero this implies /3(R(n,m)) = 0(n 6 ), settling the question of whether there is a polynomial 
degree bound on the generators for this ring of invariants. To prove the upper bound on cr(R(n, to)), Derksen 
and Makam discover a concavity property of blow-ups, and rely crucially on Lemma l5.6l proved in this paper. 

After 2D] appeared, in we show that the technique of Derksen and Makam can be constructivized. 
By combining that with the techniques in this paper, we obtain a constructive deterministic polynomial-time 
algorithm for computing the non-commutative rank over large enough fields. There we also present another 
independent proof of a(R(n,m)) < n 2 + n. That argument also builds on Lemma |5.6I from this paper and 
is much simpler than the concavity argument of Derksen and Makam. 

Organization. In Section[2]we present certain preliminaries. In Section[3]we give an exposition of the natural 
connection between commutative and the non-commutative Edmonds problem, and prove Proposition 11.31 
In Section[4]we present an efficient construction of division algebras, to be used in proving the main technical 
lemma Lemma 15.41 . In Section [5] we prove the formal version of Theorem 11.41 (Theorem 15.111) and deduce 
Corollary 11.51 and 11.61 

2 Preliminaries 

2.1 Notation 

For the reader’s convenience, we collect the main notations in this section. Some of these were already 
introduced in the introduction. 

For n £ N, [n] := {1,..., n}. Given two vector spaces U and V, U < V denotes that U is a subspace of 
V. 0 denotes the zero vector or the trivial vector space. 

Let F be a field. char(F) denotes the characteristic of F. M(n, F) is the linear space of n x n matrices 
over F. The rank of A € M(n,F) is denoted rk(A). The corank of A, cork(A), is n — rk(A). For U C F”, 
A -1 (f7) = {« S F 11 A(v) £ U}. I denotes the identity matrix. 

A linear subspace of M(n,F) is called a matrix space. For Bi £ M(n, F), * £ [to], (!?i, ..., B m ) denotes the 
matrix space spanned by the Bfs. For a matrix space B < M(n , F), rk (B) is defined as max{rk(I?) | B £ B}. 
We call B singular , if rk(S) < n. For B < M(n, F) and U < F", B(U ) := {VJb^bB{U)). The non-commutative 
rank, ncrk(S), is n — c where c is the maximum integer such that there exists a c-slirunk subspace. 

For A £ M(d, F) and B £ M(n,F), the tensor product A®Bisadxd block matrix with block size 
n x n. For A = (ai,j)ije[d], the (i,j)-th block of A <S) B is aijB. For B = (£?i,... ,B m ) < M(n, F), the dth 
tensor blow-up , B^ := M(d, F) ®B< M(dn, F). A linear basis of B^ is {E it j (g) Bk \ i,j £ [d], k £ [to]}, 
where Eij is the matrix with 1 at the (i, j)th position, and 0 otherwise. In Section [5] it will be easier to work 
with B ,yd } := B ® M(d. F). As B^ = B^ d \ the latter will also referred to as the dth tensor blow-up of B. 




2.2 The second Wong sequences 

Let us introduce a key tool to be used in Section^ called the seconcfl (generalized) Wong sequence. This was 
used by Fortin and Reutenauer [33], and rediscovered by the first two authors with Karpinski and Santha 
in |44j to solve Edmonds’ problem for rank-1 spanned matrix spaces over arbitrary fields. 

Given A £ M(n, F) and B < M(n, F), the second Wong sequence of ( A,B ) is the following sequence 

of subspaces in F”: Wo = 0, W\ = S(A _1 (Wo)), ..., W* = B(A~ 1 {Wi-\)), _ It can be proved that 

Wo < Wi < W 2 < • • • <Wi = Wi +1 = ... for some I £ {0,1,..., n}. We. is then called the limit of this 
sequence, denoted as W*. 

A useful way to understand the second Wong sequence is to view it as a linear algebraic analogue of the 
augmenting path on bipartite graphs. While not precise, we find this intuition helpful. That is, we view 
matrices as linear maps from V to W, V = W = F ra . Vectors in V and W may be thought of as the “vertices” 
on the left and right part, respectively. Then for A £ B, thinking of A as a given matching, A _1 (0) can be 
understood as identifying those “vertices unmatched by A on the left part.” Then £>(A _1 (0)) is understood 
as taking those “edges” outside A, and A~ 1 (B(A~ 1 ( 0))) is understood as taking a further step with those 
“edges” in A. And so on. 

The key fact is that, when A £ B, W* < im(A) if and only if there exists a cork(A)-shrunk subspace [441 
Lemma 9] (reproduced below as Fact 12. l|) . If this is the case, A is of maximum rank and A -1 (W*) is a 
cork(A)-shrunk subspace. It is clear that the second Wong sequence can be computed using polynomially 
many arithmetic operations. The direct way to compute the second Wong sequences over Q may cause the 
bit lengths to explode. If testing whether W* < im(A) is the only concern (as in our application here), by 
replacing A -1 with some appropriate “pseudo-inverse” of A, the bit lengths of the intermediate numbers 
up to the first Wk, Wk ^ im(A), can be bounded by a polynomial of the input size. We refer the reader 
to [44] Lemma 10] for this trick. 

When |F| is f2(n), this immediately gives a method to decide whether ncrk(Z?) = rk {B) as in [33]: randomly 
choose a matrix A £ B, which will be of maximal rank with high probability. Then compute the second 
Wong sequence of (A,B) and check whether the limit W* C im(A). 

For completeness we summarize the above discussion together as a fact. 

Fact 2.1 ( 231 Lemmas 9 and 10]). Let A £ B < M(n, F), and let W* be the limit of the second Wong 
sequence of (A, B). Then there exists a cork (A)-shrunk subspace of B if and only if W* C im(A). If this is 
the case then A -1 (IV*) is a cork(A) -shrunk subspace of B. In the algebraic RAM model as well as over Q 
we can detect whether W* C im(A) and if so compute a shrunk subspace in deterministic polynomial time. 

For a matrix space B of dimension 2, rk (B) = ncrk(S) for large enough F; this follows from the Kronecker- 
Weierstrass theory of matrix pencils, and alternative proofs may be found in 0EH. Due to this fact, it was 
observed in [44] that by utilizing the second Wong sequence we have the following. 

Fact 2.2 ( |441 Fact 11]). Assume that |F| > n, and let B = ( A,B) < M{n, F). Then rk(A) = rk(H) if and 
only if for any i £ [n], (BA~ 1 ) l (0) < im(A). 


3 Gurvits’ algorithm and Proposition 11.3 

Commutative and the non-commutative Edmonds problem: a natural pair. Viewing matrices as linear maps 
between two vector spaces, one may suspect Edmonds problem to be a linear algebraic analogue of the 
maximum matching problem on bipartite graphs, with elements of the underlying vector spaces as being the 
left and right side vertices, and the matrices as giving us edges - mapping a vector on the left side to one on 
the right side. Given such a correspondence, one may ask whether an analogue of Hall’s theorem holds in this 
setting, i.e., is it true that a matrix space either has a matrix of rank s, or has an (n — s + l)-shrunk subspace; 

8 The first Wong sequence is the dual of the second one; this naming convention is due to Wong who in ES defined the two 
sequences for the special case B = (B). 
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or put differently, whether rk (B) = ncrk(£>) holds for all B. This is far from the truth! For example, for the 
space of skew-symmetric matrices (A = —A T ) of size 3, we have ncrk = 3 and rk = 2. 

That is, while for the bipartite maximum matching problem, matchings and shrunk subsets are two 
sides of the same coin, in the linear algebraic setting, this coin splits into two problems: Edmonds’ original 
(commutative) problem asks to compute the maximum rank, and the non-commutative Edmonds problem 
asks to compute the maximum c for the existence of a c-shrunk subspace. 

Rank-1 spanned matrix spaces. Now we point out that several results on the (commutative) Edmonds problem 
can be viewed, and should be understood as, resolving the non-commutative counterpart. For this, note that 
shrunk subspaces are a natural witness for the singularity of matrix spaces: this construction can be dated 
back to 1930’s in T. G. Room’s book [65| . and plays a key role in several results which solve special cases of 
Edmonds’ problem including [5911441152] . 

A particular case of interest is rank-1 spanned matrix spaces: those matrix spaces that have a basis 
consisting of rank-1 matrices. For rank-1 spanned spaces, the analogue of Hall’s theorem holds [55], so 
the commutative and the non-commutative Edmonds problems coincide Therefore, the known results for 
rank-1 spanned spaces [591IU can be viewed as solving either the non-commutative Edmonds problem or 
the commutative one. In retrospect, the results for rank-1 spanned spaces rely on shrunk subspaces in such 
a critical way that they should be understood as solving NCFullRank rather than the commutative version 
for this special case: 

• The core of Gurvits’ algorithm [39] is an iterative procedure called the operator Sinkhorn’s scaling 
procedure. When applied to a matrix space B, this procedure converges, if and only if B has a shrunk 
subspace. 

• The key tool in [5J is the second Wong sequence as described in Fact 12.11 When applied to A £ 
B < M(n, F), this sequence stabilizes in polynomially many number of steps, and the limit subspace 
is contained in im(A) if and only if B has an cork(A)-shrunk subspace. 

Gurvits’ algorithm; Proof of Provosition \77^[ In fact, Gurvits’ algorithm works by assuming that an analogue 
of Hall’s theorem for perfect matchings holds. 

Theorem 3.1 ( [39))■ Over Q, given a matrix space B such that either rk (B) = n or ncrk(,B) < n, there 
exists a deterministic polynomial-time algorithm that solves SDIT, and therefore NCFullRank. 

Gurvits’ algorithm almost solves NCFullRank over Q. The only problem is that for a matrix space B 
with n = ncrk(yB) > rk(Z?) the algorithm may give a wrong answer. (We note that even when the input to 
the algorithm is such a matrix space it terminates in polynomially many steps.) We observe that this can 
be rectified by considering matrix semi-invariants up to the upper bound for a(R(n,m)). 

Proof of Provosition \l.A Recall that, by assumption, the nullcone of R(n,m) is defined by elements of 
degree < a(R(n, m)) over Q. Also, given a matrix space B £ M(,s, Q), Gurvits’ algorithm either reports that 
rk(,B) = s, or ncrk(£J) < s. When rk(£?) = s or ncrk(£?) < s, it is always correct. 

The algorithm is easy to describe: for d = 1,..., a = cr(R(n, to)) run Gurvits’ algorithm with input B^ d \ 
If for some d, Gurvits’ algorithm reports rk(£>I d l) = dn, then output ncrk(S) = n and halt. Otherwise, return 
ncrk(S) < n. 

It is clear that this algorithm runs in time polynomial in the input size and a. Note that a linear basis 
of B^ can be constructed easily in time polynomial in the input size of B and d. 

From the discussion in Section o the correctness is also easy to see. Specifically, assuming the bound 
on a, the simultaneous vanishing of det(Yi ® B\ + • • • + Y m ® B m ) £ ] for d = 1,..., a, characterizes 

whether ncrk(,B) < n or not. Therefore, if ncrk(Z?) = n, then for some d < c r, rk)!?^) is full. On the other 
hand if ncrk(S) < n, then there is a shrunk subspace U. For each d, Q d ® U is a shrunk subspace of B^ 
and so ncrk(£j[ d l) < dn for any d. □ 
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Implications to Gurvits ’ algorithm. The invariant-theoretic viewpoint also connects to a question of Gurvits 
in j39]. In [39], given a basis {Si,..., B m } of B < M(n, C), Gurvits associates with it a completely positive 
operator i.e., a linear map F : M (n, C) —> M(n,C). The main algorithmic technique is the so-called operator 
Sinkhorn’s iterative scaling procedure, which is applied to F. This procedure is a quantum generalization 
of the classical Sinkhorn’s iterative scaling procedure, which is applied to nonnegative matrices, and can 
be used to approximate the permanent, and to decide the existence of perfect matchings [IJJJjSTj. Gurvits 
proved that this procedure, when applied to the operator T derived from a matrix space S, converges if and 
only if B has a shrunk subspace. He proved this using a continuous but non-differentiable function, called 
the capacity of an operator, denoted as Cap(F). Specifically, he showed that B has a shrunk subspace if and 
only if Cap(-F) = 0. Gurvits asked whether there exists a “nice” function, like a polynomial with integer 
coefficients, that characterizes B with shrunk subspaces. Our previous argument suggests that there exists 
a set of polynomial functions with integer coefficients, whose simultaneous vanishing characterizes those B 
with shrunk subspaces, and therefore a “nice” substitute for Gurvits’ capacity. However, the number of 
these polynomial functions depends on the degree bound for matrix semi-invariants. 


4 Efficient construction of division algebras 

Division algebras, and efficient construction of such algebras with explicit matrix representations play a 
crucial role in the main technical lemma, Lemma 1571 in this paper. In this section we present an efficient 
construction of such algebras based on Kummer extensions. 

4.1 Basic facts about central division algebras 

Let us first introduce some basic facts about central division algebras. Proofs of the these statements can 
be found in [50]. 

Let F be a field. A division algebra D over F is an associative F-algebra in which the non-zero elements 
are invertible. The center of a division algebra D over F is obviously an extension field of F. All the division 
algebras considered in this section are finite dimensional over their center. The opposite division algebra D op 
is the algebra with the same set of elements as D and with multiplication x ■ y defined to be y * x, with * 
being the multiplication in D. 

When the center coincides with F, we say that D is a central division algebra over F, and in this case, 
diniF(D) = d, 2 for some positive integer d. This d is called the index of D. For x,y G D we can consider 
the linear transformation p x ^ y on D, considered as a vector space of dimension d 2 over F, defined as p X) yZ = 
x*z*y. The linear extension of the map x®y H > n XtV to D®D op gives an isomorphism D®D op = M(d 2 , F), 
the algebra of d 2 x d 2 matrices with entries in F. The image of D Cg) / under this isomorphism gives a 
representation of D in the space of d 2 x d 2 matrices with entries in F. What is important for us is the 
observation that matrices giving us a representation of D op in M(d 2 , F) commute with the matrices giving 
us a representation of D in M(d 2 1 F). 

4.2 Constructing cyclic field extensions under a coprime condition 

Our division algebras will be cyclic algebras, that is, non-commutative algebras constructed from cyclic 
extensions of fields. In this subsection we present an efficient construction of such field extensions, under the 
condition that the extension degree and the field characteristic are coprime. 

Recall that a cyclic extension of a field K is a finite Galois extension of K having a cyclic Galois group. 
By constructing a cyclic extension L we mean constructing the extension as an algebra over K, e.g., by 
giving an array of structure constants with respect to a K-basis for L defining the multiplication on L as 
well as specifying a generator of the Galois group, e.g, by its matrix with respect to a K-basis. Recall that 
for a finite dimensional algebra A over the field K, a common way to specify the multiplication is using an 
array of structure constants with respect to a K-basis Ai,..., Ad- These are d 3 elements jijk of K such that 
AiAj = Efc=i li-jk.Ak- Then we can represent elements of A by the vectors of their coordinates in terms of 
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the basis Ai, ..., Ad . The size of the data representing the structure constants gives some control over the 
size of the data representing the product of elements. For example, consider the following situation: K is 
the function field F '(Z), where F' is a field and Z a formal variable. The structure constants happen to be 
polynomials in F'[Z]. Then for two elements of A with their coordinates being polynomials in F'[Z], their 
product will have also polynomial coordinates, and the degrees of the coordinates of the product are upper 
bounded by the sum of the maximum degrees of coordinates of the factors, plus the maximum degree of the 
structure constants. 

Lemma 4.1. Let F' be a field. Let d be any non-negative integer if the characteristic of F' is zero, otherwise 
assume that d is not divisible by the characteristic ofW. Assume that F' contains a known primitive dth 
root of unity (,, and let X be a formal variable. Then a cyclic extension L having degree d over K := F'(A) 
can be computed using poly(d) arithmetic operations. L will be given by structure constants with respect to 
a basis, and the matrix for a generator of the Galois group of L/K in terms of the same basis will also be 
given. All the output entries (the structure constants as well as the entries of the matrix representing the 
Galois group generator) will be polynomials of degree poly(d) in F'[X]. Furthermore for F' = Q[v' / T] J the bit 
complexity of the algorithm (as well as the size of the output) is poly(d). 

Proof. Put L = F'(Yi) where X = Y d . Then 1, Yi,..., Y^ -1 are a F'(A)-basis for L with YfY( = Y( + ^ if 
i + j < d and XY( + ^ d otherwise. Further note that the linear extension cr of the map sending Y( to Cf Y( 
is a K(A)-automorphism of degree d. □ 

Remark 4.2. The construction above is known in the literature as a Kummer extension. When the char¬ 
acteristic is a prime p and a divisor of d, say d = p e d' where d 1 is prime to p, the Kummer extension K 
should be replaced by a cyclic extension which is a product of a Kummer extension of degree d! and a cyclic 
extension of degree p e described by Artin, Schreier and Witt [55]. Investigating the complexity of computing 
such extensions requires some further work. In [45] ■ we conduct such a research and present an efficient 
construction of such extensions. The consequence on results in this paper will be reported in ibid.. 

4.3 Constructing cyclic division algebras 

The following statement connects cyclic field extensions with central division algebras. It follows from 
Wedderburn’s theorem characterizing cyclic division algebras (see e.g. |50] Theorem (14.9)]) as shown on 
Page 221 of [50]. 

Fact 4.3. Let L be a cyclic extension of degree d of a field K. Let a be a generator of the Galois group, and Z 
a number transcendental over L. For the transcendental extension L (Z) of L, a extends to an automorphism 
(denoted again by a) ofl^(Z) such that the fixed field of a is K(Z). Thus L (Z) is a cyclic extension o/K(Z). 
Consider the K (Z)-algebra D generated by (a basis for) L and by an element U with relations U d = Z and 
Ua = a a U (for every a £ L(.2f), or, equivalently for every a from a fixed K -basis for ’h). Then D is a central 
division algebra of index d over K(Z). 

The following proposition is an algorithmic realization of Fact 14.31 

Proposition 4.4. Let L be a cyclic extension of degree d of a field K, and suppose that L is given by 
structure constants with respect to a K -basis A\,..., Ad- Similarly, a generator a for the Galois group is 
assumed to be given by its matrix in terms of the same basis. Let Y be a formal variable. Then one can 
construct a K (Y)-basis F of M(d,K(Y)) such that the K (Y d )-linear span ofT is a central division algebra 
over K(Y d ) of index d, using poly(d) arithmetic operations in K. 

Proof. Let Z = Y d . Let D be a central division algebra over K(Z) as in Fact 14.31 The existence of a 
K(Z)-subalgebra D' of M(d, K(Y)) isomorphic to D follows, e.g., from Theorem (14.7) of [50]. To construct 
a basis T for such a matrix algebra D' efficiently, note that ARJ^ , i,j = 1,..., d, form a K(Z)-basis of D. 
This is also a K(Y)-basis for the algebra D" = K(Y) <S)k(z) D. Consider also the element Uq = y®U. Then 
Uq = 1. As the elements Uq are linearly independent over K(Y) and hence over K (Z) as well, we have that 
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E = Uq + Uq + ... + Uq _1 + Uq is nonzero. As WE = Y d E, we have AiE (i = 1,..., d) form a K(F)-basis 
for the left ideal D"E of dimension d. Now the action of D" on this left ideal gives a matrix representation 
for D". Let 7 k ij be the structure constants for the multiplication of L (and of L (Z)): 

d 

A k Ai ='^2'Y kij A j (k,i = l,...,d). 
i=i 

Also, let Suj be the entries of the matrix of the £th power of the generator a of the Galois group: 

d 

Af = ^2 6 ^o A j (4 i = !> ■ ■ • > d )- 
j =1 

(Notice that the matrix (Suj)ij is the ItYi power of ( Siij)ij , whence the degrees of its elements are also 
bounded by poly(d).) Then 

d 

A k AiE — ^ ( 7 kijAjE 


U e AiE = Afu l E = AfY e E = Y e ^ SujAjE. 

i=i 

Thus the matrix of the action of A k has entries 7 k ij and the matrix of the action of U e has entries Y e duj- 
Then the action of U c A k can be obtained as the product of these two matrices. Let T consist of all such d 2 
products, and the proof is concluded. □ 

Combining Lemma 14. II and Proposition 14.41 we immediately obtain the following. 

Lemma 4.5. Let d, X, F', K = F'(X) and L be as in LemmaIn particular, if char(K) = p > 0 then 
p \ d. Then one can construct a F '(X,Y)-basis T of M(d,¥'(X,Y)) such that the ¥'(X,Y d )-linear span 
of T is a central division algebra over F'(A, Y d ) of index d, using poly(d) arithmetic operations in F'. In 
particular, for any A £ T, the entries of A are polynomials in F'[A', Y] of degree poly(d). Furthermore for 
F' = Q[v^T], the bit complexity of the algorithm (as well as the size of the output) is also poly(d). 

4.4 Some algorithmic issues for actual applications 

To put the above construction in action, we need to handle a few algorithmic problems as follows. 

4.4.1 Algorithmic issues when working with field extensions 

Lemma n~Tl assumes the field F' contains a known primitive dth root of unity where if char(F') = p > 0 then 
p \ d. In actual applications, we may start with a field F without a primitive dtli root of unity in it, and attach 
one symbolically, which we still denote by £. However, this may cause some problem. Namely, constructing 
F' = F[C] would require factoring the polynomial x d — 1 over F, a task which cannot be accomplished using 
basic arithmetic operations. To see that this is indeed an issue notice that a black-box field may contain 
certain “hidden” parts of cyclotomic fields. Of course, over certain concrete fields, such as the rationals, 
number fields or finite fields of small characteristics, this can be done in polynomial time. However, even 
over finite fields of large characteristic no deterministic polynomial time solution to this task is known at 
present. 

To get around this issue, one can perform the required computations over an appropriate factor algebra 
R of the algebra C = ¥[x\/{x d — 1) in place F' as if R were a field. To be specific, as d is not divisible by 
the characteristic, we know that C is semisimple - actually it is isomorphic to a direct sum of ideals, each 
of which is isomorphic to the splitting field F[v^T] of the polynomial x e — 1 for some divisor e of d , and the 
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projection of x to such an ideal is a primitive eth root of unity. It follows that if we compute the ideal J 
generated by annihilators of x e — 1, for all e a proper divisor of d, then R = C/J is isomorphic to the direct 
sum of copies of the splitting field F' of x d — 1, and the projection of x to each component is a primitive dth 
root of unity. And this property is inherited by any proper factor of R. A computation using R instead of 
F' may fail only at a point where we attempt to invert an non-invertible element of R. However, such an 
element must be a zero divisor. When this situation occurs, we replace R with the factor of R by its ideal 
generated by the zero divisor and restart the computation. Such a restart can clearly happen at most d — 2 
times. 

We explain what the above scheme entails in our actual tasks. 

As the methods for Proposition 14.41 and Lemma 14.51 do not require division, the zero divisor issue does 
not occur there. Replacing F' with R , the outcome T of Lemma 14. 5 1 will actually be a free R(X, U)-basis for 
an algebra which is a direct sum of isomorphic copies of a division algebra, embedded into M(d, R{X, Y)). 

Now consider the task of computing the rank of M(N,F'). Note that we cannot talk about the “rank” 
of matrices in M(N,R) which is not well-defined. But since I? is a direct sum of F', the decomposition of 
R induces a decomposition of M(N,R) into a direct sum of copies of M(N, F'). We call the images of the 
projections of a matrix B £ M(N,R) to the direct summands the components of B. The following lemma 
describes how to compute the maximum rank over the components. 

Lemma 4.6. Let R and F' be as above, and suppose we are given a matrix B £ M(N, R). Then there exists 
a deterministic polynomial-time algorithm that computes the maximum rank over the components of B. 

Proof. This can be achieved by combining division-free algorithms for computing the determinant by e.g. 
Kaltofen [49] (see also [53] for more such algorithms), and the parallel algorithm for computing the rank of 
a matrix by Mulmuley [5S|. 

We include a sketch here for completeness. To start with, instead of B we consider the symmetric matrix 

B' = (^p>T ■ Then let x and y be two formal variables. Form a matrix D = diag(l, y, y 2 ,... ,y 2N ~ 1 ), 

and compute det (xl — DB') using J3j5j, considered as a polynomial in F'(j/)[a;]. Let M be the maximum 
integer such that x M divides det(a;/ — DB'), and return (2TV — M)/2. 

By [55], the above procedure on a matrix from M(N, F') returns its rank. Now for B £ M(N,R), since it 
is (implicitly) a direct sum of several copies of M(N, F'), the above algorithm on B can be viewed as working 
with these components “in parallel”, and the resulting det(a;J — DB') is a direct sum of det(a;J — DB'f) 
where B[ are the components of B'. It is then not hard to deduce that the above procedure computes the 
maximum rank over the components of B. □ 

Remark 4.7. Using the method of Lemma 14.61 for rank computations, we will obtain an algorithm that does 
not require division in R at all, and hence we will not need the above mentioned restarts. Another possibility 
would be doing Gaussian elimination and restarting the computation once a zero divisor is met as described. 
If no zero divisors are met and the rank is r, then it means that the columns of B generated a free module 
over R of rank r, so each component is also of rank r over F'. 

Finally, we note that a similar issue, namely that a black box field may even contain infinite algebraic 
extensions of its subfields has been circumvented by using the transcendental extension K = F'(X) in the 
construction of cyclic extensions (Lemma 14. II) . 

4.4.2 Computing the rank of matrices over a rational function field in few variables 

Note that the matrices from Lemma T4. 5 1 are matrices over a rational function field. Therefore we will need 
to compute the rank of matrices in such form. 

Proposition 4.8. Let F' be a field and K = F'(Xi, X 2 ,..., Xj.) be a pure transcendental extension of F'. Let 
A be an N x N matrix with entries as quotients of polynomials from F'[Ab, X 2 ,..., X^], where the polynomials 
are explicitly given as sums of monomials. Assume that the degrees of the polynomials appearing in A are 
upper bounded by D. If |F'| = (ND) n ( k \ then we can find in time ( ND)°^ a matrix B £ M(N,F') with 
rk (B) = rk(A). 
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In particular, if k is a constant - k = 2 as used in Lemma 15.31 for the procedure in Lemma 15.71 - then the 
above procedure runs in polynomial time. 

Proof. We multiply the entries of A by an easily computable common multiple (e.g., the product) of their 
denominators to obtain a matrix with polynomial entries from F'[A'i, X 2 , ■ • •, A*,]. The data describing this 
matrix has size polynomial in the size of the input data. In particular, the degree of the determinant of any 
sub-matrix is upper bounded by a polynomial s in ( ND) k . We have assumed that |F'| = ( ND ) n ( fc ). Then 
from ( ND )°( k 1 specializations by elements of a subset of size s + 1 of F', at least one gives a matrix with 
entries F' having the same rank as the original matrix. Thus the rank of A can be computed by computing 
the rank of ( ND)°^ matrices over F'. □ 

Note that if we simulate F' using R as described in Section 14.4.11 then we shall apply the procedure in 
Lemma 14.61 after specializing the variables as in Proposition 14.81 

5 Finding a nonsingular matrix in blow-ups 

In this section we describe, given B < M(n, F), how to compute a nonsingular matrix in B^ = M(d, F) ®B 
for some d < (n + 1)!, or certify that this is not possible. 

One important note is due here: as per our notation, matrices in M(d, F) g M(n, F) are viewed as block 
matrices, where each block is of size n x n. This is more convenient when describing semi-invariants. In 
this section, it will be more convenient to work with M(n, F) g M(d, F), namely each block is of size d x d. 
This is consistent with other parts simply because M(d, F) g M(n, F) = M(n,F) g M(d, F). Therefore, we 
identify M(nd, F) = M(n, F) g M(d, F), and fix such a decomposition. Recall the notation B(U), and B ^ 
from Section l2Jl 

After some preparation, we prove the main technical lemma called the regularity lemma for blow-ups. 
Then we prove Theorem 15. Ill fthe formal version of Theorem ll.4l) . and Corollaries 11.51 and 11.61 follow easily. 

5.1 Preparations 

A characterization of blow-ups. 

Proposition 5.1. For A < M(dn, F), A = B W for some B < M(n, F) if and only if (7 g M(d,¥))A(I g 
M{d,¥))=A. 

We remind the reader that (I g M(d,¥))A(I g M(d, F)) = {XAY : X £ / g M(d, F), A £ A,Y £ 
I g M(d, F)}. Note that multiplication of tensor products of matrices obeys the rule (Xi g Yi)(X 2 g Y 2 ) = 
X- l X 2 fg>YY 2 . 

Proof. The only if part is obvious. To see the reverse implication, for i,j £ [d]. let E ^ stand for the 
elementary matrix in M(d, F) in which the (i,j) th entry is 1 and the others are zero. Then for every 
quadruple £ M] 4 we have EkiEi'j>Ejk = Sa/Sjjil, where stands for the Kronecker 

delta. Any element A oi A can be written as ® Eij with Bij £ M(n, F). Then for every 

i, j £ [d] we have 5Z fc=1 (7 g Eki)A(I g Ejk) = Bij g /, which implies that B^ g / are in A. So define B as 
{B £ M (n. F) : such that Bgl £ A}, and we see that A = B^ d \ □ 

Note that (7 g M(d,¥))A(I g M(d, F)) = A is equivalent to saying that A is an M(d,¥) sub-bimodule 
of M(n, F) g M{d , F), where we identify M{d, F) with I g M(d, F). Similarly, for a subspace W < F” g F d , 
one can see that W is of the form Wo g F d if and only if (7 g M(d, F))Wo = Wo, that is, Wo is an M(d, F)- 
submodule of F" g F d . 
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Shrunk subspaces in the blow-up situation. 

Proposition 5.2. If A = Bhas an s-shrunk subspace, then A has an s'-shrunk subspace where s' > s 
such that d divides s', and B has an s' / d-shrunk subspace. 

Proof. As A = B ^, (I 0 M(d, F) )A(I 0 M(d, F)) = A. Assume that U is an s-shrunk subspace of A: with 
W = A(U) we have dimpl/ — dinip W = s. Then (I 0 M(d,¥))W = (/ ® M(d,¥))AU = AU = W, thus 
W = Wq 0 F d for some Wq < F". Similarly, as A(I 0 M(d, F)) = A, we have A(1 0 M(d, ¥))U = W, whence 
U' = (1 0 M(d,¥))U is an s'-shrunk subspaces with s' > s, and U' = t/o0F d with some Uo < F". Note that 
dim(W), dim(f7'), and therefore s', are all divisible by d. Noting A = B 0 M(d, F), we have Wq < B(Uq) 
and so Uq is an s'/d-shrunk subspace. □ 

From the extension field to the original field. Assume that for some extension field K of F we are given a 
matrix A' £ B 0f K < M(n, K) of rank r. Then, if |F| > r, using the method of [18,, Lemma 2.2], we 
can efficiently find a matrix A £ B of rank at least r. This procedure is also useful to keep sizes of the 
occurring field elements small. For completeness we include a brief description. Let SCF with IS] = r + 1 
and let B\, ...,£?£ be an F-basis for B. Then A' = a[B i + ... + a' t Bg, where a' £ K. As A' is of rank r, 
there exists an r x r sub-matrix of A with nonzero determinant. Assume that a\ £ S. Then we consider 
the determinant of the corresponding sub-matrix of the polynomial matrix xB\ + a' 2 B 2 + ... a' e B^. This 
determinant is a nonzero polynomial of degree at most r in x. Therefore there exists an element a\ £ S such 
that a\B\ + a' 2 B 2 + ... a' e Be has rank at least r. Continuing with a ' 2 ,..., a' e , we can ensure that all the af s 
are from S. Since the Bf s span B , the resulting matrix of rank at least r is in B. We record this too as a 
fact. 

Lemma 5.3 (Data reduction, [TB] Lemma 2.2]). Let B < M(k x f,F) be given by a basis B \,..., B m , and 
let K be an extension field of F. Let S be a subset of F of size at least r + 1. Suppose that we are given a 
matrix A' = ^ a\Bi £ B 0f K of rank at least r. Then we can find A = ^ aiBi £ B of rank also at least 
r with m £ S. The algorithm uses poly (k,£,r) rank computations for matrices of the form 'fZa'fBi where 
a'l £ {ai,.. .,a' m } U S. 

5.2 Regularity of blow-ups 

Our goal in this subsection is to prove that when the field size is large enough, the maximum rank over 
A = B^ < M(dn,¥) is always divisible by d. The proof is constructive when char(F) = 0, or when 
char(F) \ d: if we get a matrix in A of rank at least rd + 1, we will be able to construct a matrix in A of 
rank at least (r + l)d. This is the main technical tool to be used in the proof of Theorem 15.1 f I 

We first present a version of the regularity lemma in which a matrix division algebra as in Lemma 14.51 is 
assumed to be part of the input. 

Lemma 5.4 (Regularity of blow-ups, technical version). Assume that we are given a matrix A £ B^ < 
M(dn, F) with rk(A) = (r— l)d+k for some 1 < k < d. Let X andY be formal variables and put M. = ¥'(X), 
where F' is a finite extension of F of degree at most d. Suppose further that |F| > ( nd ) n W and that we are also 
given a K (Y)-basis T of M(d, K(F)) such that the K(T d )-linear span ofT is a central division algebra D' over 
K(F rf ). Let 6 be the maximum of the degrees of the polynomials appearing as numerators or denominators of 
the entries of the matrices in T. Then, using (nd + d) 0 ^ 1 -* arithmetic operations in F, one can find a matrix 
A" g g{d} T[\t(A") > rd. Furthermore, over Q the bit complexity of the algorithm is polynomial in the 
size of the input data (that is, the total number of bits describing the entries of matrices and the coefficients 
of polynomials). 

Proof. Instead of the blow-up B^ d,d ^ = B0 M (d, F) we start with the blow-up B0M (d, K(F)). Then both 
£{d,d} and B D r 

span, as a K(F)-linear space, the blow-up B 0f M(d,K(Y)). 

Claim 5.5. Every matrix in M(n,¥) 0 D' C M(d, K(Y)) has rank (as a matrix over K(F)) divisible by d. 
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Proof. Firstly note that M(n, F) ®f D' = M(n,K(Z)) ®k(z) D' , since D' is a K(Z)-algebra. As D’ C 
M(d,K.(Y)), M(n,K(Z)) ®k(z) b ' acts naturally on the K(Z)-space K(Z) n ® K(E) d = WL(Z) n ® K (Z) d = 
K(Z) nd2 . Since D’ ®k(z) D' op = M(d 2 ,K(Z)) [50j Corollary 15.5], it follows that the centralizer of this 
action is isomorphic to the opposite algebra D' op . Therefore the image A'K(F )* 1 of any A ' £ M(n, F) ®f D' 
is a H ,op -submodule, whence its dimension over K(Z) is divisible by d 2 . It follows that the dimension over 
K(F) is divisible by d. □ 

The claim enables us to “round up” the rank of A to the next multiple of d. Let Bi ,..., be an F-basis 
of B. Since rk(A) > (r — l)d over F, clearly rk(A) > (r — l)d over K(E) as well. Now T, the K(Z)-basis 
for D' is a K(E)-basis for M(d,K(Y)). Therefore A , as a matrix over K(F), can be expressed as a linear 
combination (with coefficients over K(Y)) of {Bi ®C:i£ [d],C G T}. We use the method of Lemma [5.31 
to find coefficients from K(Z) (or even from F) such that the combination A! of the basis element for D' 
has rank also larger than (r — 1 )d. We have A! £ B ® D' , whence by Claim [5751 the rank of A' is at least 
rd. Then we express A' as a linear combination of elements - with coefficients from K(Y) - of an F-basis of 
gftMl which is also a K(F)-basis for B ® M(d,K(Y)). Then we use again the algorithm of Lemma [5.31 to 
replace these coefficients to elements of F to find a matrix A" £ B of rank at least rd. □ 

Using Remark 14.21 and Lemma H~5l we immediately obtain the following results. 

Lemma 5.6 (Regularity of blow-ups, non-constructive). For B < M(n, F), assume that |F| > (jzd)^ 1 ). 
Then rk(I3'f d ^) is divisible by d. 

Lemma 5.7 (Regularity of blow-ups, constructive). For B < M(n, F) and A = B^, assume that cha,r(F) = 
0 or char(F) \ d, and |F| > (nd) 0 *- 1 -*. Then, given a matrix A £ A with rkA > (r — 1 )d, there exists a 
deterministic algorithm that returns A £ A of rank > rd. This algorithm uses poly(nd) arithmetic operations 
and over Q, all intermediate numbers have bit lengths polynomial in the input size. 

To see all the ingredients together, we give an expanded description in Algorithm [T] 


Algorithm 1: Algorithm RoundUpRank 

Input: B = <Bi, ...,B m )< M(n,F). SCF with |S| = ( nd) a W. A £ fiW of rk(A) > (n - 1 )d. If 
char(F) = p > 0 then p \ d. 

Output: A" £ B^ with rk(A") = nd. 

1 F 7 £- F[£], where ( <— a dth root of F; 

2 X U Y ■£- two independent formal variables; 

3 T = {Ci,..., Cd 2 } <— a F'(Xi, y)-basis of M(d, F'(Xi, y)) such that F'(Ai, F d )-linear span of T is a 
central division algebra over F'(Ai,y d ); 

4 Expand A = Ylie[m],je[d 2 ] \jBi®Cj, gF'(Ai,F) as a matrix in B® M(d,¥'(Xi,Y)); 

5 Compute A! = Y^ie[m],je[d 2 } Tij B i ® C,-, p,ij £ S such that rk(A') =nd> rk(A); 

6 A = {Eij | i,j £ [d]} t— the standard basis of M(d,F'(Xi,Y)); 

// That is Ei j(i, j) = 1, and other entries are 0 

7 Expand A = ® Ej ,fc, Wyfc ^ ^ (-^iA )i 

s Compute A" = J2ie[ m \,j,ke{d] ® E jik , &j,k G S such that rk(A") = rk(A'); 

9 return A". 


We remind the reader how our previous development supports Algorithm [T] For Line 1, Section 14.4.11 
describes how to work with F[£]. Line 3 calls the procedure in Lemma 14.51 which in turn utilizes Lemma l4.ll 
Line 4 and 7 are standard tasks in linear algebra. Line 5 and 8 rely on Lemma 15.31 which in turn calls 
Proposition [478] Lemma f4.6l ensures that the rank calculation is essentially correct. 

We mention an immediate consequence. 
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Corollary 5.8. Let B < M(n, F). Assume that the characteristic of F is zero, d > n, and we are given a 
matrix A G B W with rkA = rd. Then, for any d' > d we can efficiently find A G B^ d ^ with rk A > rd'. 

Proof. Induction on d! and d. Assume first that d' = d+1. Then whence (ci+l)(r — 1) < 

dr. Therefore, if we embed B^ into B^ d+1 ^ then A has rank rd > (r — 1 )(d + 1) and Lemma [5.71 applies. 
Proceed with (d + 2, d + 1) in place of (d + 1, d). □ 

We close this subsection with two open problems. 

Remark 5.9. 

1. It is an interesting problem to investigate whether (the non-constructive version of) the regularity 
lemma would hold over small fields. Giving a proof not relying on division algebras might be a 
progress in this direction. 

2. Corollary 15.81 probably remains true for d < n. However, we do not see how to prove it. 

5.3 Incrementing rank via blow-up 

We have introduced in Section 12.21 a key technique here, namely the second Wong sequences. Recall that 
given A G B < M(n,¥), the second Wong sequence can be used to detect whether there exists a cork(A)- 
shrunk subspace. If such a shrunk subspace exists, then ncrk(£>) = rk(A). The difficulty is the case when 
such cork(A)-shrunk subspace does not exist. One natural idea to proceed is to find A! G B of rank > rk(A), 
and use A’ to test whether a cork(A')-shrunk exists or not. If B is rank-1 spanned, such an A' does exist, 
and another ingredient in |44j is an update procedure that finds A! of higher rank in this case. However, this 
in general is not possible, since rk(Z?) and ncrk(£?) may differ. Fortunately, the concept of blow-ups helps: 
instead of looking for A! G B of rank > rk(A), we shall look for A' G B ^ of rank > rk(A)d for some not 
too large d G N. It turns out that this is achievable, and an application of the regularity lemma even yields 
A" G B W of rank > (rk(A) + 1 )d. 

Theorem 5.10. Let B < M(n,¥) and A = B^. A ssume that we are given a matrix A G A with rk(A) = rd. 
Let d' he an integer > r. Suppose that |F| is (nddand if char(F) = p > 0 then assume p \ dd'. There 
exists a deterministic algorithm that returns either an (n — r)d-shrunk subspace for A (equivalently, an 
(n — r)-shrunk subspace for B), or a matrix A* G A ® M(d', F) of rank at least (r + 1 )dd'. This algorithm 
uses poly {ndd') arithmetic operations and, over Q, all intermediate numbers have bit lengths polynomial in 
the input size. 

Proof. We present the algorithm formally as Algorithm [2j 

Let us first outline what the algorithm does. From Line 1 to 6 it computes the second Wong sequence with 
respect to {A, A). Line 7 and 8 deal with the case when the sequence provides a cork(A)-shrunk subspace. 
In the other case, we first utilize the sequence to get a matrix A" of rank > rdd' (Line 9 to 25). Then we 
obtain the desired A*, by applying the regularity lemma (Lemma [577]) to A". 

We now explain some implementation details of the algorithm. 

Line 3, i < r + 1 The second Wong sequence, when applied to matrix spaces of the form A = B^ d \ stabi¬ 
lizes faster because of the following. Since (7®M(d, F)).4 = A , at stage j we have (7®M(ci, ¥))AWj = 
AWj , whence the dimension of AWj is divisible by d for every j. It follows that, until stabilization, 
the dimension of AWj increases by at least d and so the sequence stabilizes to its limit in at most r + 1 
steps when applied to A G A of rank rd. 

Line 9 To compute Cf s, one perform the following. Take a basis of A. Search for a basis element Y 
such that YA~ 1 (AA~ 1 ) e ~ 1 ( 0) % im(A). Put Ci = Y and search for a basis element Y such that 
CcA~ l YA- l (AA~ i y~ 2 {0) <2 inr(A). Continue this iteration and the desired G^’s can be computed. 
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Algorithm 2 : Algorithm IncrementRank 


Input: B = (Bi,... ,B m ) < M(n, F). A £ A = B ^ of rk(A) = rd. An integer d! > r such that if 
char(F) = p > 0 then p \ d'. SCF with |,S| = (ndd') n ^. 

Output: One of the following: (1) An (n — r)d-shrunk subspace of A; (2) A* £ A^ d ^ = B^ dd ^ with 
rk(A*) > (r + l)dd' . 

1 W 0 , W* £- 0; 

2 £ = 0 ; 

3 while W* C irn(A) and f < r + 1 do 


£<=£ + 1 ; 

Wi ■£- AA~ 1 (Wi-i)\ 
W* = We. 


7 if W* C im(A) then 

8 |_ return A~ l (W*). 

//In the following £ is the smallest integer i such that Wi % im(A). 

9 Compute Ci,..., Cg £ A such that Ct A _1 C^_iA _1 ... CiA _1 (0) % im(A); 

10 Take v\ £ A _1 (0) = ker(A) such that CiA~ 1 Ci-\A ~ 1 ... Ci(ui) ^ im(A); 

11 for i = 2 to £ do 

12 |_ Vi £- A _1 C l _ 1 (u i _ 1 ). 


// Note that Ctvi ^ im(A). 

13 {u\,..., Ud’} <— a basis of ¥ d ; 

14 for i = 1 to £ do 

is if i = r + 1 = d' then 

16 L Zi £- the matrix in M[d' , F) such that Z r+ i(u r +i) = U\ and Z r+ \(uj) = 0 for j ^ r + 1 . 


17 else 

18 L Zi ■£- the matrix in M(d', F) such that Zi(ui) = Ui +1 and Zi(uj ) = 0 for j ^ i. 


19 A' £- A 0 /; 

20 C' i — Ci 0 Z\ T • • • T Ci C) Zi j 

21 if rk(C') > rdd! then 

22 |_ A" £- C'. 

23 else 

24 Compute A £ S such that A' + A C' is of > rdd !; 

25 |_ A" £- A' + A C'. 

26 Compute A* of rank > (r + 1 )dd' using A" and Lemma [5771 

27 return A*. 
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That Algorithm [5] runs in the stated time bound follows easily from Fact 12.11 and Lemma o To see 
the correctness of the algorithm, it remains to prove that in Line 21 to 25 we do obtain A" of rank > rdd'. 
Consider the vectors w\ = w t = v t ®u t . We now observe that: (1) w\ € ker A'; (2) A'wj = C'vjj-i 

for j = 2,.. .,i; (3) Cw t = C t v t <g> u t+ 1 g (AF" d ) <g) F d '; as AF" d <8> F d ' D A'F ndd ', we have C’w t (£ A'F ndd '. 
This means that the limit of the second Wong sequence for the pair ( A', (A 1 , C')) runs out of the image of 
A!. By Fact 12.21 A! is not of maximum rank in (A',C), and Line 21 to 25 just describe a straightforward 
method to obtain a matrix of highest rank in a 2-dinrensional matrix space. □ 

An iteration based on Theorem 15.101 proves the following Theorem 15.Ill Note that in Theorem 15. 101 d! 
can be chosen as either r + 1 or r + 2, depending on which is not divisible by char(F). 

Theorem 5.11. Suppose we are given B := (Bi,..., B m ) < M(n, F), and A £ B with rk(A) = s < n. 
Let d = (n + l)!/(s + 1)!, and assume that |F| = f l{nd). Then there exists a deterministic algorithm, that 
computes a matrix B E B 0 M(d ', F) of rank rd! for some d! < d and, if r < n, an (n — r)-shrunk subspace 
for B. The algorithm uses poly(ro,<i) arithmetic operations, and when working over Q, has bit complexity 
polynomial in n, d and the input size. 

Now Corollary 11.51 and 11.61 follow easily. To see Corollary 11.51 note that if we choose a matrix in B 
randomly, it will be of maximum rank. Using that matrix as A in Theorem 15.111 Corollary 11.51 is proved. 
For Corollary II.61 if B has no shrunk subspace, a full-rank matrix will be certainly present in M(d!, F) <g) B 
for some d! < (n + 1)!, giving us the upper bound on a(R(n,m)). 
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A Derksen’s bound applied to R(n, m ) 

Proof. We just need to indicate certain parameters for the matrix semi-invariants that are used in Derksen’s 
bound. 

Suppose a group G acts on a vector space V rationally, and let R be the resulting invariant ring. Theorem 
1.1 in m shows that the degree bound is upper bounded by max(2,3/8 • s ■ cr(R) 2 ), where a is the degree 
bound for defining the nullcone, and s is the dimension of R. 

s is upper bounded by the number of variables. Therefore for R(m, n), s < mn 2 < n 4 . 

To bound a, we use Proposition 1.2 in M- Recall that G as an algebraic group, is defined by a system 
of polynomial equations in zi,...,Zt- For example, SL(n,F) x SL(n,F) is defined by det(A) = 1 and 
det(F) = 1, where X and Y are n x n variable matrices. The action of G is rational, so it can be recorded 
as p : G —> GL(E) by g — > ( o-i,j(g))i,jedim.(y)t where each cqj is a polynomial in z\, ..., Zt- 

<j(R) is then upper bounded by where t is the number of variables used to define G as above, 

to = dim(G), H is the maximum degree over polynomials defining G, and A is the maximum degree over 
polynomials defining the action. So for R(n,m), t = 2n 2 , to = 2n 2 — 2, H = n, and A = 2. It follows that 
a(R(n,m)) < n 2 ■ 2 2n ^~ 2 . 

Therefore R(n , to) is generated by elements of degree < 3/128 • n 8 • 16 n . □ 
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